Reducing phylogenetic bias in correlated mutation analysis.

نویسندگان

  • Haim Ashkenazy
  • Yossef Kliger
چکیده

Correlated mutation analysis (CMA) is a sequence-based approach for ab initio protein contact map prediction. The basis of this approach is the observed correlation between mutations in interacting amino acid residues. These correlations are often estimated by either calculating the Pearson's correlation coefficient (PCC) or the mutual information (MI) between columns in a multiple sequence alignment (MSA) of the protein of interest and its homologs. A major challenge of CMA is to filter out the background noise originating from phylogenetic relatedness between sequences included in the MSA. Recently, a procedure to reduce this background noise was demonstrated to improve an MI-based predictor. Herein, we tested whether a similar approach can also improve the performance of the classical PCC-based method. Indeed, performance improvements were achieved for all four major SCOP classes. Furthermore, the results reveal that the improved PCC-based method is superior to MI-based methods for proteins having MSAs of up to 100 sequences.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Major Sources of Genetic Differentiation Among Apricot Latent Virus (ApLV) Isolates

Background and Aims: Apricot latent virus (ApLV) is a species within Foveavirus genus (Betaflexiviridae family, Tymovirales order). Phylogenetic analyses using different ORFs nucleotide sequences divided most ApLV isolates into two clusters. However, there is little data about the sources of genetic differentiation among ApLV isolates. Materials and Methods: Partial coat protein (CP) sequences...

متن کامل

Genetic and Phylogenetic Analysis of Adani Goat Population Based on Cytochrome B Gene

Identification of genetic characteristics is an important factor for preservation of species life. The aim of this study was to identify the genetic characteristics of the Adani goat populations based on the cytochrome b (Cyt b) gene and to detection its phylogenetic relationships with the domestic and wild goat species using NCBI database. Blood samples were taken from 12 Adani goat and subseq...

متن کامل

Codon bias patterns in photosynthetic genes of halophytic grass Aeluropus littoralis

Codon bias refers to the differences in the frequency of occurrence of synonymous codons in coding DNA. Pattern of codon and optimum codon utilization is significantly different between the lives. This difference is due to the long term function of natural selection and evolution process. Genetics drift, mutation and regulation of gene expression are the main reasons for codon bias. In this stu...

متن کامل

The correlation between synonymous and nonsynonymous substitutions in Drosophila: mutation, selection or relaxed constraints?

Codon usage bias, the preferential use of particular codons within each codon family, is characteristic of synonymous base composition in many species, including Drosophila, yeast, and many bacteria. Preferential usage of particular codons in these species is maintained by natural selection acting largely at the level of translation. In Drosophila, as in bacteria, the rate of synonymous substit...

متن کامل

Effectiveness of Mindfulness Oriented Recovery Enhancement Approach on Attentional Bias and Disability in Chronic Pain Patients

 Aims and background: Selective attention to pain-related stimuli, known as pain attentional bias (AB) can exacerbate pain, disability and undermine quality of life. The aim of this study was to determine effectiveness of mindfulness oriented recovery enhancement approach on attentional bias related to pain and disability among Chronic Pain Patients. Materials and methods: The present study was...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Protein engineering, design & selection : PEDS

دوره 23 5  شماره 

صفحات  -

تاریخ انتشار 2010